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Abstract 


Three-dimensional pictorial displays incorporating depth cues by 
means of stereop sis offer a potential means of presenting information in 
a natural way to enhance situational awareness and improve operator 
performance. Conventional computational techniques rely on asymp- 
totic projection transformations and symmetric clipping to produce the 
stereo display. Implementation of two new computational techniques, an 
asymmetric clipping algorithm and a piecewise linear projection trans- 
formation , provides the display designer with more control and better 
utilization of the effective depth-viewing volume to allow full exploita- 
tion of stereopsis cuing. Asymmetric clipping increases the perceived 
field of view (FOV) for the stereopsis region. The total horizontal FOV 
provided by the asymmetric clipping algorithm is greater throughout the 
scene viewing envelope than that of the symmetric algorithm. The new 
piecewise linear projection transformation allows the designer to cre- 
atively partition the depth-viewing volume , with freedom to place depth 
cuing at the various scene distances at which emphasis is desired. 


Summary 

Three-dimensional (3-D) pictorial displays incor- 
porating depth cues by means of stereopsis offer a 
potential means of presenting information in a natu- 
ral way to enhance situational awareness and improve 
operator performance. Conventional computational 
techniques rely on asymptotic projection transforma- 
tions and symmetric clipping to produce the stereo 
display. Implementation of two new computational 
techniques, an asymmetric clipping algorithm and a 
piecewise linear projection transformation, provides 
the display designer with more control and better 
utilization of the effective depth-viewing volume to 
allow full exploitation of stereopsis cuing. 

Stereo displays are created by generating both 
left-eye and right-eye views of the visual scene. Con- 
ventional computational techniques utilized in stere- 
opsis display generation rely on symmetric clipping 
algorithms and asymptotic projection transforma- 
tions to provide the left-eye and right-eye stereo pair. 
Clipping algorithms are used to limit the computed 
view to the display screen boundaries, For a fixed 
screen distance and size, symmetric clipping dictates 
a smaller monocular field of view (FOV) for each eye 
than asymmetric clipping. Combining the monocu- 
lar FOV for both eyes results in different stereo over- 
lap regions and single-eye viewing regions at differ- 
ent scene distances for the two clipping approaches. 
The perceived FOV for the stereopsis region and 
also the total horizontal FOV provided by the asym- 
metric clipping algorithm are greater throughout the 
scene viewing envelope than those of the symmetric 
algorithm. 


Conventional asymptotic projection transforma- 
tions, which are used to map the visual scene depth 
into the stereo viewing volume, allow the display de- 
signer to fix a specific scene distance at the screen 
location (i.e., the display designer may decide to set 
up the projection transformations such that objects 
150 ft away appear to be at the screen distance in 
the stereo depth- viewing volume, objects closer than 
150 ft appear to be in front of the screen, and ob- 
jects farther away than 150 ft appear to be behind 
the screen). The conventional asymptotic projection 
transformation also allows additional control by let- 
ting the designer specify a maximum distance any 
image is to appear from the viewer. This two-point 
control provided by the asymptotic projection trans- 
formation allows stereo emphasis to be placed only at 
one scene distance. However, the new piecewise lin- 
ear approach allows the designer to creatively parti- 
tion the depth-viewing volume, with freedom to place 
depth cuing at the various scene distances at which 
emphasis is desired. 

The implementation of these computational tech- 
niques furnishes the display designer with more 
control and better utilization of the effective depth- 
viewing volume to allow full exploitation of stere- 
opsis cuing in advanced flight display concepts that 
embody true 3-D images of synthetic (computer- 
generated) objects or scenes. 

Introduction 

Current electronic display technology can provide 
high-fidelity, “real world” pictorial displays (under 
flicker-free conditions) that incorporate true depth 



in the display elements. Advanced flight display 
concepts that embody true 3-D images of synthetic 
(computer-generated) objects or scenes are being 
conceived and evaluated at NASA Langley Research 
Center and at the Wright Research and Develop- 
ment Center (refs. 1-7). Innovative concepts are 
sought that exploit the power of modern graphics 
display generators and stereopsis, not only for situa- 
tional awareness enhancements of pictorial displays, 
but also for the declutter of complex informational 
displays and to provide more effective alerting func- 
tions to the flight crew. 

The intuitively advantageous use of a three- 
dimensional presentation of three-dimensional infor- 
mation, rather than the conventional two- 
dimensional presentation of such information, has 
been investigated for years within the flight display 
community (refs. 8 14). These efforts have been par- 
ticularly intense for helmet-mounted head-up display 
applications, as stcreopsis cuing is a feature of binoc- 
ular helmet systems (refs. 5 and 8 11). Additional in- 
vestigations utilizing electronic shutters or polarized 
filters, rather than helmet optics, to present separate 
left-eye and right-eye views have also been conducted 
(refs. 1-4, 6, 7, and 11-14). 

Stereoscopic displays provide depth information 
by means of lateral disparity and the eye muscu- 
lar cues associated with convergence. In these dis- 
plays, the distance that affects eye accommodation 
(focus) is the viewer-screen distance, which remains 
constant. Thus, the major depth cue missing in syn- 
thetic generation of stereo 3-D displays is the change 
in eye accommodation with fixation point depth. 
And this is indeed a major deficiency, for accommo- 
dation and convergence are highly associative. For 
a fixed accommodation distance, there is a limited 
range of convergence conditions that will result in 
comfortable, clear, fused, single vision. Thus, for a 
given viewer-screen distance for a stereoscopic dis- 
play, there are limits to the amount of lateral dis- 
parity that is usable. Recent experiments at NASA 
Langley Research Center (ref. 15) determined that 
the effective region of stereopsis cuing (the depth- 
viewing volume) for traditional directly viewed stereo 
3-D display systems is on the order of 2 to 3 ft. 

Stereo displays are created by generating both 
left-eye and right-eye views of the visual scene. Con- 
ventional computational techniques that are utilized 
in stereopsis display generation rely on symmetric 
clipping algorithms and asymptotic projection trans- 
formations to provide the left-eye and right-eye stereo 
pair. The goal of this effort was to expand upon these 
conventional techniques to provide the display de- 


signer with more complete control and better utiliza- 
tion of the effective depth-viewing volume to allow 
full exploitation of stereopsis cuing in advanced flight 
display concepts. This increased control is provided 
by the implementation of two new computational 
techniques, an asymmetric clipping algorithm and a 
piecewise linear projection transformation. Asym- 
metric clipping increases the perceived fields of view 
for the stereopsis regions. It also provides horizontal 
fields of view that are greater throughout the scene 
viewing envelope than those of the symmetric al- 
gorithm. The piecewise linear approach allows the 
designer to creatively partition the depth-viewing 
volume, with freedom to place depth cuing at the 
various scene distances at which emphasis is desired. 

Generation of Stereo 3-D Displays 

Modern computer graphics generators provide 
depth cues by creating for each eye a separate view 
of the visual scene by means of various hardware sys- 
tems, such that the right eye sees only the right-eye 
scene and the left eye sees only the left-eye scene 
(stereopsis cues). These hardware systems include 
helmet-mounted displays, which depend on a direct 
presentation of each eye view, stereoscopes (refract- 
ing or reflecting), and systems that incorporate shut- 
ters (electronics or mechanical) (fig. I), or filters (po- 
larized or color). 

Regardless of the display hardware system, graph- 
ics software is necessary to create the left-eye and 
right-eye stereo pair images. To perform this task, 
the graphics computer resolves the single-viewpoint 
visual data base into the desired stereo pair. In order 
to produce synthetic graphic objects that appear to 
have depth in 3-D space, it is necessary to displace 
the objects in each stereo pair image so that each 
eye will view the scene with the parallax required to 
achieve the desired depth. Figure 2 illustrates the 
parallax concept employed to produce objects that 
appear behind the monitor screen. Figure 3 illus- 
trates the concept as it is employed to produce ob- 
jects at various depths. To present an object that 
appears at the depth of the screen, both of the stereo- 
pair views are drawn in an overlapping manner so it 
appears that only one image is displayed. For objects 
to appear behind the screen, the object is displaced 
to the left for the left-eye view and to the right for 
the right-eye view, with the displacement reaching a 
maximum value to place an object at infinity. For 
objects to appear in front of the screen, the images 
are crossed so that the left-eye view is displaced to 
the right and the right-eye view is displaced to the 
left. 
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Figure 1. Stereo 3-D hardware system. 
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Figure 2. Concept for introducing depth. 
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Figure 3. Top view of the geometric principle for producing left- and right-eye views. 


Graphics software is used to generate the lateral 
displacement, which is known as lateral disparity (see 
fig. 4 for the geometric relationships involved). First, 
the left-eye and right-eye coordinate systems are 
created as offsets from the viewer coordinate system 
of the visual scene. Clipping is then employed to 
limit each eye view to the display surface boundaries. 
Finally, simple perspective division (ref. 16) is used to 
transform the three-dimensional viewing volumes to 
two-dimensional viewports, wdiose centers are offset 
from the center of the display screen by half of the 
maximum lateral disparity. 

Enhanced Stereo 3-D Algorithms 

Conventional computational techniques utilized 
in stereopsis display generation rely on symmetric 
clipping algorithms and asymptotic projection trans- 
formations to provide the left-eye and right-eye stereo 
pair images. The clipping algorithms are used to 
limit the computed view to the display screen bound- 
aries, and the projection transformations are used to 
map the visual scene into the stereo viewing volume. 

Implementation of two new computational tech- 
niques, an asymmetric clipping algorithm and a 
piecewise linear projection transformation, provides 


the display designer with more complete control and 
better utilization of the effective depth-viewing vol- 
ume. This increased flexibility allows better exploita- 
tion of stereopsis cuing. 

Stereo 3-D Clipping 

Figure 5 presents an illustration of symmetric and 
asymmetric clipping, while table I presents the effects 
of each algorithm on the perceived horizontal FOV. 
With symmetric clipping (shown in the figure as a 
top view for the left eye), the left side of the left-eye 
viewing volume is determined by the left edge of the 
display surface. The right side of the left-eye view- 
ing volume is determined by reflecting the left side 
about the sight vector, thus producing symmetric 
angles. In stereo 3-D displays, the left-eye and right- 
eye projections are offset from the screen centerline, 
producing unused blank spots on the display sur- 
face if symmetric clipping is employed. For a screen 
distance of 19 in. and a 40° horizontal FOV, these 
blank spots generate two 7° monocular regions on 
each edge of the monitor, with an overlap stereo 3-D 
region of 26° . This is not the most efficient use of the 
available screen width. Asymmetric clipping uses the 
entire display surface for each eye view. The left side 
of the left-eye viewing volume is determined by the 




Figure 5. Stereo 3-D clipping algorithms. 
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Table I. Perceived Horizontal Field-of-View for Symmetric and Asymmetric Clipping 


Screen distance — 19 in. 
Horizontal monitor width = 12.4 in. 
Interocular distance = 2.5 in. 


Symmetric clipping Asymmetric clipping 



Perceived horizontal FOV, deg 

Scene 

object 

distance 

Left 

monocular 

Stereo 

Right 

monocular 

Total 

Nearest 

36.0 

0 

36.0 

72.0 

Screen 

0 

40.0 

0 

40.0 

Infinity 

6.65 

33.2 

6.65 

46.5 


Scene 

object 

distance 

Perceived horizontal FOV, deg 

Left 

monocular 

Stereo 

Right 

monocular 

Total 

Nearest 

30.8 

0 

30.8 

61.6 

Screen 

7.0 

26.0 

7.0 

40.0 

Infinity 

0 

33.2 

0 

33.2 


left edge of the display surface, just as in symmet- 
ric clipping. The right side of the viewing volume, 
however, is determined by the right edge of the dis- 
play surface. This generates unequal, or asymmet- 
ric, angles about the sight vector. With asymmetric 
clipping, there are no unused portions of the display 
surface, with the stereo 3-D overlap region encom- 
passing the entire 40° of the FOV. 


Symmetric clipping thus dictates a smaller monoc- 
ular FOV for each eye for a fixed screen distance and 
size. Combining the monocular fields of view for both 
eyes results in different stereo overlap regions and 
single-eye viewing regions at the screen and at differ- 
ent scene distances for the two clipping approaches. 
Table I presents perceived fields of view for the two 
algorithms. The horizontal FOV usually considered 
for nonstereo displays is that of the display surface, as 
measured from the midpoint between the eyes. How- 
ever, for stereoscopic displays, the perceived FOV 
(e.g., with asymmetric clipping, the angle between 
the left boundary for the right eye and the right 
boundary for the left eye as measured from the mid- 
point between the eyes) varies with the depth of fixa- 
tion within the viewing volume. For image planes in 
front of the screen, the total perceived FOV is larger 
than that at screen distance, while the total perceived 
FOV for image planes near infinity is smaller. Asym- 
metric clipping provides increased fields of view for 
both the stereo region and the total scene through- 
out the depth- viewing volume. As shown in the 
table, asymmetric clipping provides greater per- 
ceived fields of view for the stereopsis regions and 
greater total horizontal fields of view T throughout the 
scene viewing envelope than those of the symmetric 
algorithm. 


Stereo 3-D Projection Transformations 

It is well known from psychophysical research that 
the effective range of human stereo vision is sev- 
eral hundred feet (ref. 17). In synthetically gener- 
ated views of real-world scenes that are to be dis- 
played stereoscopically, it is necessary to map the 
real-world depth cues of hundreds of feet into a vir- 
tual volume that is only several feet in depth (ref. 15). 
Figure 6 illustrates the mapping of a “real world 1 ' 
scene to the stereo viewing volume. Conventional 
asymptotic projection transformations allow the dis- 
play designer to fix a specific scene distance at the 
screen location in the viewing volume. In figure 6, a 
real-world distance of 150 ft has been selected as the 
scene distance to map to the screen distance in the 
virtual volume. All objects closer than 150 ft from 
the viewer in the real-world scene will appear in front 
of the screen (objects such as the tree, located 100 ft 
away), whereas all objects farther away than 150 ft 
will appear behind the screen (e.g., the aircraft, lo- 
cated 200 ft away). The top portion of figure 7 illus- 
trates the shape of the mapping curve as it is applied 
to a general case. Control of the real-world scene 
distance to screen location intercept and control of 
the maximum allowed virtual image distance allow 
some limited shaping of the curve, which can be used 
to place stereo emphasis over a single desired real- 
world range. If stereo emphasis is required over mul- 
tiple real-world ranges, or over a large single range, 
the conventional asymptotic projection transforma- 
tion will not be adequate. An example for this type of 
requirement could be a telerobotic arm operator per- 
forming a “peg in the hole” type of task. In this task, 
the telerobotic arm operator uses a remotely located 
robot arm to retrieve a peg from a hole and place it 
in another hole located farther away. If the operator 
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Figure 6. Scene- to-screen mapping problems with conventional stereo technology. 



New piecewise linear transformations allow 
arbitrary partitioning of depth-viewing volume 



Distance cues by size, perspective, and other monocular cues 
Figure 7. Visual scene mapping to stereo 3-D viewing volumes. 




is viewing a stereo 3-D presentation of the scene, it 
could be advantageous to have stereo emphasis at the 
perceived depth of each of the two holes. W ith the 
conventional asymptotic projection transformation, 
stereo emphasis could be placed only at the depth of 
one of the holes. However, with the new piecewise 
linear approach, the stereo viewing volume could be 
partitioned into two regions where stereo emphasis 
could be applied. The bottom half of figure 7 illus- 
trates this concept. Objects in stereo region 1 (the 
first hole, and the peg at the start of the task) and 
objects in stereo region 2 (the second hole, and the 
peg at the end of the task) would be presented with 
stereo depth cues, while objects outside these two re- 
gions would only contain monocular depth cues (i.e., 
perspective, and interposition). The piecewise linear 
projection transformation allows the display designer 
to creatively partition the depth- viewing volume into 
multiple regions where the stereo depth mapping can 
be independently controlled. This gives the designer 
the freedom to place depth cuing at the various scene 
distances at which emphasis is desired. 

Implementation 

Both of these new techniques could be imple- 
mented in true algorithmic form by passing every 
display vertex in the scene through a series of equa- 
tions. However, because most modern graphics gen- 
erators are designed to transform vertices with a 4 x 4 
transformation matrix, the speed of the algorithms 
would be greatly increased if they were put into ma- 
trix form. In fact, since stereo 3-D depth mapping 
and clipping are so closely related, both can be imple- 
mented in a single projection transformation matrix. 
Appendix A shows the derivation and construction of 
the asymptotic/asymmetric projection matrix, and 
appendix B shows the derivation and construction 
of the piecewise lincar/asymmetric projection ma- 
trix. Symmetric clipping is rarely desired and thus 
its implementation is not discussed. The mechanics 
of the transformation of the 3-D scene data base by 
using one of the above-mentioned projection trans- 
formation matrices is discussed in section 6.5 of ref- 
erence 16. The generation of left-eye and right-eye 
images requires that the scene data base be rendered 
twice, once with a left-eye projection transformation 
matrix and once with a right-eye projection trans- 
formation matrix. When the two stereo pair images 
are presented to the appropriate eye (the left eye sees 


only the left-eye image, and the right eye sees only 
the right-eye image), the viewer will perceive a stereo 
3-D representation of the scene data base. 

The decisions of which projection to use (asymp- 
totic or piecewise linear) and how to manipulate the 
projection, arc task dependent. The asymptotic pro- 
jection will produce a more natural looking presen- 
tation of a real-world scene than the piecewise linear 
projection. A simple rule to follow could be if the 
purpose of the stereo 3-D presentation is to produce a 
more natural or realistic representation of a scene, or 
if the task involved can be accomplished with stereo 
emphasis in one region (i.e., a formation-flying or 
station-keeping type of task where the asymptotic 
screen intercept value is set to the desired formation 
or station distance), use an asymptotic/asymmetric 
projection. Although the piecewise linear projection 
technique is more flexible and can make better use 
of the stereo 3-D depth-viewing volume, the result- 
ing stereo 3-D display will have an artificial feel. As 
implemented in this paper, the asymptotic projec- 
tion can be used successfully in many applications. 
The piecewise linear projection should be reserved 
for those occasions when the limits of an asymptotic 
projection preclude the ability to easily or safely per- 
form the desired task. 

Concluding Remarks 

The implementation of two new computational 
techniques, an asymmetric clipping algorithm and a 
piecewise linear projection transformation, provides 
the display designer with more complete control and 
better utilization of the effective stereo 3-D depth- 
viewing volume. These enhancements allow full ex- 
ploitation of stcreopsis cuing. Asymmetric clipping 
increases the perceived fields of view for the stere- 
opsis regions, and also, the total horizontal fields of 
view provided by the asymmetric clipping algorithm 
are greater throughout the scene viewing envelope 
than those of the symmetric algorithm. The new 
piecewise linear projection transformation allows the 
designer to creatively partition the depth-viewing 
volume, with freedom to place depth cuing at the 
various scene distances at which emphasis is desired. 


NASA Langley Research Center 
Hampton, VA 23681-0001 
June 11, 1992 
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Appendix A 

Asymptotic/ Asymmetric Projection 
Transformation Matrix 

The asymptotic/asymmetric projection transfor- 
mation matrix is a 4 x 4 element matrix that is used to 
convert a 3-D scene data base into a 2-D perspective 
representation for the left-eye and right-eye images of 
a stereo pair. The matrix is built based on a right- 
handed viewer coordinate system with the origin lo- 
cated at the viewpoint in the scene data base. The 
x-axis has positive direction to the right, the y-axis 
is positive up, and the z-axis is positive behind the 
viewer. All vertices in the data base are represented 
in homogeneous (x, y, z, w) form. The homogeneous 
coordinate w is a scale parameter and is initially set 
to 1. If, after any of the steps in the projection trans- 
formation, the w coordinate is a value other than 1, 
it is forced to unity by dividing the vertex coordi- 
nates (x, y, z, and w) by the nonunity w coordinate 
value. For a more complete discussion of the homo- 
geneous coordinate system and 3-D geometric trans- 
formations, viewing, and projections, please refer to 
chapters 5 and 6 in reference 16. 

To generate the two eye views, left-eye and right- 
eye matrices are constructed. The scene data base 
is then rendered twice, once after being transformed 
by the left-eye projection matrix and once after be- 
ing transformed by the right-eye projection matrix. 
When the two stereo pair images are presented to 
the appropriate eye (the left eye secs only the left-eye 
image, and the right eye sees only the right-eye im- 
age), the viewer will perceive a stereo 3-D represen- 
tation of the 3-D scene data base that encompasses 
the advantages and properties of the asymptotic pro- 
jection transformation and the asymmetric clipping 
algorithm. This matrix is derived from eight param- 
eters, four of which describe the physical aspects of 
the presentation and four that determine the asymp- 
totic mapping and depth-viewing volume desired. 

Physical Parameters 

Interpupillary distance. The interpupillary 
distance (IPD) is the distance between the eyes of 
the person viewing the stereo 3-D presentation. 

Screen distance . The screen distance (SD) is 
the distance from the stereo 3-D viewer to the display 
surface. 

Width. The width (W) is the physical width of 
the display surface. 

Height. The height (H) is the physical height of 
the display surface. 


Asymptotic Mapping and Depth- Viewing 

Volume Parameters 

Minimum view distance . The minimum view T 
distance (MIND) is the minimum distance a virtual 
image is to appear from the viewer. As objects are 
presented closer and closer to the viewer, eventually 
a limit is reached at which the viewer can no longer 
comfortably fuse the stereo 3-D presentation into a 
single image (ref. 15). Setting the minimum view 
distance parameter to within this limit ensures that 
no objects will be too close to fuse. 

Maximum view distance. The maximum view 
distance (MAXD) is the maximum distance a virtual 
image is to appear from the viewer. As objects 
arc presented farther and farther away from the 
viewer, eventually a limit is reached at which the 
viewer can no longer comfortably fuse the stereo 3-D 
presentation into a single image (ref. 15). Setting 
the maximum view distance parameter to within this 
limit ensures that no objects will be too far away to 
fuse. As objects in the real-world scene approach 
an infinite distance away from the viewer, they will 
approach the maximum view distance in the virtual 
image. 

Screen intercept. The screen intercept (SI) is 
the distance in the real-world scene that is to appear 
at the screen distance in the virtual image. The 
region in the real world centered about this distance 
will have the most pronounced stereo effect. 

Far clipping plane. The far clipping plane 
(FCP) is the distance in the real-world scene beyond 
which objects will not be drawn. This is used to 
help speed up the drawing time of the display. Most 
objects get too small to see at great distances and the 
far clipping plane is used to eliminate these objects 
from the rendering process. 

Matrix Construction 

The asymptotic/asymmetric projection transfor- 
mation matrix is built in eight steps by combining ba- 
sic geometric transformation matrices. Each of the 
transformations is based on values calculated from 
one or more of the aforementioned parameters. 

Projection offset translation. The first step 
in the construction is to offset the centerline of the 
projection horizontally to generate either a left-eye 
or right-eye coordinate system. The amount of off- 
set is dependent on the viewer’s eye separation, the 
screen distance, the maximum view distance, and the 
screen intercept value. The projection offset transla- 
tion (POT) is the major transformation controlling 
what real-world scene distance is to appear at the 
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depth of the display surface in the stereo 3-D virtual 
image (screen intercept value). The relationship is 
described by the following equation: 


POT = 


IPD 

~Y~ 


x 


1 . 0 - 


SD \ 
MAXD ) 


x 


SI 

SD 


The derivation of this equation is discussed in a 
later section. The projection offset equation above is 
for the generation of the left-eye coordinate system. 
To generate the right-eye coordinate system, the 
projection offset translation is the negative of the 
left-eye value. The translation transformation matrix 
used for the projection offset is shown as matrix (Al): 


plane. This projection is achieved by dividing the 
x- and y-coordinates of the data base vertices by 
the vertex z-coordinate value. The screen distance 
scale step above has established the 2-D plane to be 
coincident with the display screen. To accomplish 
this division with a transformation matrix, a matrix 
that transposes the homogenous coordinate w with 
the vertex z-coordinate is used. This matrix is shown 
as matrix (A4): 


10 0 0 
0 10 0 
0 0 0 1 
0 0 10 


(A4) 


1 0 0 0 


[POT 0 0 1J 

z invert scale . In the right-handed coordi- 
nate system used for this construction, the posi- 
tive z-axis extends behind the viewer. In order for 
the 3-D to 2-D conversion (perspective division) to 
work correctly, the coordinate system must be a left- 
handed system with the positive z-axis extending in 
front of the viewer. This transformation is achieved 
by inverting the z-axis with a z-scale of — 1. The 
scale transformation matrix used for this operation 
is shown as matrix (A2): 


z-clip translation . After the perspective divide 
operation, the x- and y-coordinate values of the 
scene data base vertices represent the 2-D projection. 
The z-coordinate value has been transformed into its 
reciprocal. Two more transformations are necessary 
to bring the z-coordinate value within the z clipping 
range. The first of these is a translation. This z-clip 
translation (ZCT) is based on the near clipping plane 
and far clipping plane values and is the negative 
average of their reciprocals. The far clipping plane 
value is one of the eight parameters used in defining 
the projection; the near clipping plane (NCP) value 
must be calculated so that it is aligned with the 
minimum view distance: 


10 0 0 

0 10 0 

0 0-10 
0 0 0 1 


(A2) 


Nr . p _ (SI x MIND x MAXD) - (SI x SD x MIND) 
(SD x MAXD) - (SD x MIND) 


ZCT = — 


/ FCP + NCP \ 
\2.0 x FCP x NCP / 


Screen distance scale. When perspective divi- 
sion occurs, the 3-D data base is projected onto a 2-D 
plane. To maintain the correct stereo 3-D presenta- 
tion, this 2-D plane must be the same distance away 
as the display screen. To accomplish this, the x-y 
plane of the coordinate system is scaled by the screen 
distance. The scale transformation matrix used in 
this step is shown as matrix (A3): 


SD 0 0 0 

0 SD 0 0 
0 0 10 
0 0 0 1 


(A3) 


Perspective division . The perspective division 
step projects the 3-D scene data base onto a 2-D 


The derivation of the NCP equation is discussed 
in a later section. For a discussion of the effect of 
the near and far clipping planes on the z-clip trans- 
lation, please refer to chapter 6 of reference 16. The 
z-clip translation transformation matrix is shown as 
matrix (A5): 


10 0 0 
0 10 0 
0 0 10 
0 0 ZCT 1 


(AS) 


z-clip scale . The second of the z-coordinate 
transformations necessary to bring the z-coordinate 
value into the z clipping range is a scale trans- 
formation. Like the z-clip translation, the z-clip 


scale (ZCS) is based upon the near and far clipping 
plane values: 


ZCS = 


/ 2.0 x FCP x NCP \ 
V FCP - NCP ) 


For a discussion on how the near and far clipping 
planes affect the 2 -clip scale, please refer to chapter 6 
in reference 16. The 2 -clip scale transformation 
matrix is shown as matrix (A6): 


10 0 0 
0 10 0 
0 0 ZCS 0 
0 0 0 1 


(A6) 


Viewport separation . The separation between 
the left-eye and right-eye viewports controls the max- 
imum positive lateral disparity. As an object in the 
real-world scene approaches an infinite distance away 
from the viewer, the object approaches the center 
of the projection, and the center of the projection 
appears in the center of the viewport. Thus, the 
viewport separation controls where objects at in- 
finity in the real-world scene appear in the virtual 
image (maximum view distance). The relationship 
between maximum view distance and viewport sep- 
aration translation (VPT) is as follows: 


VPT = — 


IPD 


1 . 0 - 


SD \ 
MAXD ) 


The derivation of this equation is discussed in a later 
section. The equation above gives the viewport sepa- 
ration translation amount for the left-eye image. The 
right-eye value is the negative of the left-eye value. 
The viewport separation translation transformation 
matrix is shown as matrix (A7): 


1 0 0 0 
0 10 0 
0 0 10 
VPT 0 0 1 


(A7) 


Field- of -view scale . To maintain the proper 
aspect ratio and fields of view, the image plane that 
contains the 2-D projection of the 3-D scene data 
base must be scaled based on the physical constraints 
of the presentation. The three physical parameters 
that determine FOV and aspect ratio are the width 
and height of the display surface, and the viewer- 
to-screen distance. The screen distance effect has 


already been accounted for in the screen distance 
scale step. The display surface width and height 
must now be accounted for. This is accomplished 
by scaling the ^-coordinate by the reciprocal of one- 
half the width, and by scaling the y-coordinate by 
the reciprocal of one-half the height: 

fovx = A 

FOVY — =- 
H 

The FOV scale transformation matrix is shown as 
matrix (A8): 



FOVX 

0 

0 

0 




0 

FOVY 

0 

0 




0 

0 

1 

0 


(AS) 


0 

0 

0 

1 



Asymptotic /asymmetric 

projection 

trans- 


formation matrix . Sequentially multiplying the 
eight matrices presented above will generate the 
asymptotic/asymmetric projection transformation 
matrix. This matrix is shown as matrix (A9). Using 
this matrix as the projection matrix to convert the 
3-D scene data base to the 2-D perspective view used 
for the left-eye or right-eye image of the stereo pair 
is discussed in section 6.5 of reference 16: 


(SD)(FOVX) 

0 

0 

0 

0 

(SD)(FOVY) 

0 

0 

-(VPT) (FOVX) 

0 

-(ZCT)(ZCS) 

-1 

(POT) (SD) (FOVX) 

0 

-(ZCS) 

0 


(A9) 

Asymptotic Projection Mapping 

Conversions and Geometry 

The mapping of real-world distances to a stereo 
3-D virtual volume with the asymptotic/asymmetric 
projection technique requires three major real-world 
to virtual volume conversions. The first conversion 
is the mapping of the real-world infinite distance to 
the maximum view distance (MAXD) of the stereo 
3-D virtual volume, the second is that of mapping 
the real-world screen intercept (SI) distance to the 
virtual image screen distance (SD), and the third is 
mapping of the stereo 3-D virtual volume minimum 
view distance (MIND) to a real-world near clipping 
plane (NCP) value. 

Real-world infinite distance to maximum 
view distance conversion . This mapping conver- 
sion is achieved by the viewport translation step in 
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the matrix construction. The viewport translation 
value (VPT) is derived by a direct application of 
the lateral disparity equation shown in figure 4. Fig- 
ure A1 shows the geometry of the VPT derivation. 
With intcrpupillary distance IPD, screen distance 
SD, and desired maximum view distance MAXD, the 
VPT derivation is as follows: 

From similar triangles ABC and EDC , the 
relationship 


IPD VPT 

2.0 x MAXD*" MAXD - SD 


can be determined. Solving the above relationship 
for VPT, with appropriate reductions, will generate 
the VPT equation of 


VPT = 


IPD 

~~ 2 ~ 


x 


/ n SD \ 
( 10_ MAXD ) 



For a left-eye projection, the viewport, needs to be 
translated to the left, which requires a negative trans- 
lation value for the coordinate system used. Trans- 
lating the viewport centers horizontally by amount 
VPT ensures that the maximum positive lateral dis- 
parity will never exceed VPT and that no image will 
be presented farther away than MAXD. 

Screen intercept to screen distance map- 
ping. The screen intercept mapping conversion is 
controlled by the projection offset translation step 
in the matrix construction. The geometry for the 
derivation of the projection offset translation (POT) 
value that will result in the desired screen intercept 
distance is shown in figure A2. Although the center 
of projection for the left-eye view is translated from 
the center of the viewer coordinate system (point B ) 
by amount POT to point A , the result of the projec- 
tion is presented to the left eye from point D, which 
is only offset by amount VPT (this is done to ensure 
that no object will have a positive lateral disparity 
greater than VPT and no image will appear farther 
away than MAXD). Presenting the image to the eye 
with a horizontal offset less than that from which the 
image was generated produces a foreshortening effect 
that places objects in the stereo 3-D virtual image 
closer to the viewer than they are in the real world. 
This is the basis for the screen intercept mapping 
conversion. An object at point C ) which is located 
SI distance in the real-world scene directly in front of 
the viewer, will be shifted to point F in the virtual 
scene when the image is presented (lines AC and DF 
are parallel). The object will appear on the screen 
where line DF intersects the screen distance plane, 
with the lateral disparity equal to the distance be- 
tween this intersection point and the center of the 


Figure Al. Geometry of VPT derivation. 



Figure A2. Geometry of POT derivation. 

screen, point E . To make the object appear at the 
screen distance (SD) in the virtual image, it must be 
presented with no lateral disparity between the two 
eye- views. Thus, line DF must intersect the screen 
at point E . From this geometry, triangles ABC and 
DBE are similar, and the relationship of 

POT _ VPT 

~sT ” “sd“ 


III! 
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can be determined. Solving the above equation for 
POT results in 


POT = VPT x 

SD 

Inserting the equation generated for VPT earlier 
produces 


POT = 


IPD 


X 


1 . 0 - 


SD \ 
MAXD ) 


x 


SI 

SD 


The POT value for the left-eye translation is a posi- 
tive (to the right) amount. This seemingly reversed 
sign is due to the fact that in the construction of pro- 
jection matrices, the point of projection is considered 
fixed and the relative motion of the world is to the 
right for the left-eye offset. 

Minimum view distance to near clipping 
plane conversion . With the VPT and POT val- 
ues calculated, a real-world distance to virtual im- 
age distance mapping conversion can be applied with 
a guarantee that no virtual image distance will ex- 
ceed MAXD and that objects with real-world dis- 
tances of SI will appear at the plane of the screen in 
the virtual image. The remaining mapping conver- 
sion is to ensure that no object in the virtual image 
is presented closer than the minimum view distance 
(MIND). This is controlled by setting the real-world 
near clipping plane (NCP) value to a distance that 
corresponds to the MIND value in the virtual pre- 
sentation. Figure A3 shows the geometry involved 
in the NCP derivation. The lateral disparity y of an 
object perceived at distance MIND can be calculated 
with the lateral disparity equation from figure 4 with 
MIND substituted for object depth d. The equation 
would take the form 



NCP 


Figure A3. Geometry of NCP derivation. 

system. From the similar triangles ABC and DYE , 
the relationship 


NCP _ SD 
FCP ” VPT — y 


V = 


IPD 

~ 2 ~ 


x 


1.0 - 


SD \ 
MIND ) 


and the object would be drawn at point E when 
displayed at screen distance SD. With the viewport 
center at point £>, which is offset by amount VPT, 
line DE represents the projection of an object ap- 
pearing at the MIND distance in the stereo 3-D 
virtual image. To determine where the MIND dis- 
tance appears in the real-world scene, line DE is 
translated to the center of projection at point A 1 
which is offset by the POT amount. The NCP value 
is determined where this new line AC (parallel to 
DE) intersects the center of the viewer coordinate 


can be derived. By solving the above equation for 
NCP and inserting the known relationships for POT, 
VPT, and ?/, the NCP relationship can be reduced to 

_ (SI x MIND x MAXD) - (SI x SD x MIND) 

(SD x MAXD) - (SD x MIND) 

Overall geometry and distance mapping . 
One final useful derivation is the equation for the 
conversion from real-w r orld scene distance to that of 
stereo 3-D virtual volume distance. Figure A4 rep- 
resents the combination of figures A 1 to A3 and 
shows the overall general geometry of the asymp- 
totic/asymmetric projection for the left-eye view. 


13 



With screen distance SD, interpupillary distance 
IPD, and maximum screen distance MAXD, an ob- 
ject at point C, located distance d f from the viewer 
in the real-world scene, will appear at distance d in 
the stereo 3-D virtual presentation. From the geome- 
try in figure A4, by using similar triangles ABC and 
DYE and substituting the VPT, POT, and y (from 
fig. 4) relations, the stereo 3-D virtual distance d can 
be derived as a function of the real-world scene dis- 
tance d ! . This function is 

d' x SD x MAXD 

d = (d' X SD) + (SI X MAXD) - (SI x SD) 

As a proof of the validity of this equation, four tests 
may be made: (1) The limit may be taken as real- 
world scene distance d f approaches infinity. When 
this is done, d is seen to approach MAXD. (2) If the 
screen intercept value SI is inserted for d becomes 
the screen distance SD. (3) If the relationship for 
the NCP real-world scene distance is substituted for 
d ! , the equation reduces to MIND. (4) If the screen 
intercept value SI is set to the screen distance SD, 
the function reduces to 

d f x MAXD 
“ d ! + MAXD + SD 

and the limit of this function as MAXD approaches 
infinity is d ! . Thus, if the stereo 3-D virtual volume 
mapping parameters are set to equal their real-world 
counterparts (the screen intercept SI is set equal 
to the screen distance SD, and the maximum view 
distance MAXD is set to infinity), the stereo 3-D 
virtual volume distances match the real-world scene 
distances precisely. 



Figure A4. Overall geometry for asymptotic projection. 
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Appendix B 

Piecewise Linear/Asymmetric 
Projection Transformation Matrix 

The piecewise linear/asymmetric projection trans- 
formation matrix is a 4 x 4 element matrix that is 
used to convert a 3-D scene data base into a 2-D per- 
spective representation for the left-eye and right-eye 
images of a stereo pair. The matrix is built based on 
a right-handed viewer coordinate system with the ori- 
gin located at the viewpoint in the scene data base. 
The x-axis has positive direction to the right, the 
y-axis is positive up, and the 2 -axis is positive be- 
hind the viewer. All vertices in the data base are 
represented in homogeneous (x, y, 2 , w) form. The 
homogeneous coordinate w is a scale parameter and is 
initially set to 1. If, after any of the steps in the pro- 
jection transformation, the w coordinate is a value 
other than 1, it is forced to unity by dividing the 
vertex coordinates (x, y, 2 , w) by the nonunity w co- 
ordinate value. For a more complete discussion of the 
homogeneous coordinate system, and 3-D geometric 
transformations, viewing, and projections, please re- 
fer to chapters 5 and 6 in reference 16. 

To generate the two eye views, a left-eye matrix 
and a right-eye matrix are constructed for each parti- 
tion of the piecewise linear projection. The example 
on the bottom half of figure 7 contains 4 partitions. 
The first is stereo region 1, the second is the area 
bounded by the end of stereo region 1 and the begin- 
ning of stereo region 2, the third is stereo region 2, 
and the fourth extends from the end of stereo re- 
gion 2 to infinity. In this particular mapping exam- 
ple, partitions 2 and 4 will not contain stereo depth 
cues and will appear as flat postcard-like perspective 
images at discrete planes in the virtual volume. Ob- 
jects in partition 2 will appear at the screen distance 
in the virtual volume, and objects in partition 4 will 
appear at the maximum view distance of the virtual 
image. To render a stereo 3-D image with a multi- 
ple partition piecewise linear projection, the entire 
scene data base must be rendered once for each par- 
tition for each eye. The piecewise linear mapping 
example in figure 7 would require 8 scene renderings 
(4 partitions x 2 eye views). During the construc- 
tion of the matrices and the scene renderings for the 
separate partitions, the near and far clipping planes 
are used to limit the scene to the bounds of each 
partition. The near clipping plane scene distance is 
mapped to the virtual distance located at the min- 
imum view distance. The far clipping plane scene 
distance is mapped to the virtual distance located at 
the maximum view distance. Objects in the scene 
data base whose depths fall between the near clip- 


ping plane and the far clipping plane will be linearly 
projected in stereo 3-D space in the virtual image 
between the minimum view distance and the maxi- 
mum view distance. In the piecewise linear mapping 
example in figure 7, the first partition projection ma- 
trix would be built with mapping parameters of the 
near clipping plane equal to the beginning of stereo 
region 1, the minimum view distance equal to the in- 
front screen fusion limit (or to the in-front depth limit 
determined in ref. 15), the far clipping plane equal 
to the end of stereo region 1, and the maximum view 
distance equal to the screen distance. The second 
partition projection matrix would use a near clip- 
ping plane value equal to the end of stereo region 1, 
and minimum view distance equal to the screen dis- 
tance, a far clipping plane value equal to the begin- 
ning of stereo region 2, and a maximum view distance 
set equal to the screen distance. The third partition 
projection matrix would be built with a near clipping 
plane distance set equal to the beginning of stereo re- 
gion 2, a minimum view distance value set equal to 
the screen distance, a far clipping value equal to the 
end of stereo region 2, and a maximum view distance 
set equal to the behind screen fusion limit (or to the 
behind depth limit determined in ref. 15). The fourth 
and final partition projection matrix would use a near 
clipping plane value set equal to the end of stereo re- 
gion 2, a minimum view distance equal to the behind 
screen fusion limit, a far clipping plane value set to 
near infinity, and a maximum view distance set equal 
to the behind screen fusion limit. 

The piecewise linear projection matrix is also con- 
structed with four parameters that define the phys- 
ical aspects of the projection. These four physical 
parameters combined with the four mapping param- 
eters discussed above give the display designer a total 
of eight parameters to control the presentation of a 
piecewise linear /asymmetric projection. 

Physical Parameters 

Interpupillary distance . The interpupillary 
distance (IPD) is the distance between the eyes of 
the person viewing the stereo 3-D presentation. 

Screen distance . The screen distance (SD) is 
the distance from the stereo 3-D viewer to the display 
surface. 

Width . The width (W) is the physical width of 
the display surface. 

Height . The height (H) is the physical height of 
the display surface. 
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Piecewise Linear Mapping and 

Depth-Viewing Volume Parameters 

Minimum view distance . The minimum view 
distance (MIND) is the minimum distance a virtual 
image in the current partition is to appear from the 
viewer. Care must be taken so that the minimum 
view distance of the closest partition does not exceed 
the in front of screen fusion limit. 

Maximum view distance . The maximum view 
distance (MAXD) is the maximum distance a virtual 
image in the current partition is to appear from the 
viewer. Care must be taken so that the maximum 
view distance of the farthest partition does not ex- 
ceed the behind screen fusion limit. 

Near clipping plane . The near clipping plane 
(NCP) is the distance in the real-world scene of the 
current partition in front of which objects will not 
be drawn. The NCP is used in piecewise linear 
projection as the distance in the real world to be 
mapped to the minimum view distance in the stereo 
3-D virtual image. 

Far clipping plane . The far clipping plane 
(FCP) is the distance in the real-world scene of 
the current partition beyond which objects will not 
be drawn. The FCP is used in piecewise linear 
projection as the distance in the real world to be 
mapped to the maximum view distance in the stereo 
3-D virtual image. 

Matrix Construction 

The piecewise linear /asymmetric projection trans- 
formation matrix is built in eight steps by combin- 
ing basic geometric transformation matrices. Each 
of the transformations is based on values calculated 
from one or more of the aforementioned parameters. 

Linear projection offset translation . The 
first step in the construction is to offset the center- 
line of the projection horizontally to generate either a 
left-eye or a right-eye coordinate system. This trans- 
lation, in conjunction with the linear viewport sep- 
aration translation later on, is one of the two trans- 
formations controlling the slope and intercept of the 
stereo 3-D mapping for the current partition. The 
linear projection offset translation (LPOT) is calcu- 
lated based on five of the eight control parameters: 

IPD [ FCP x NCP x (MAXD - MIND) ' 

LPOT = — x [ MA xd x MIND x (FCP - NCP) 

The derivation of the LPOT equation is discussed in 
a later section. The linear projection offset trans- 
lation equation above is for the generation of the 
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left-eye coordinate system. To generate the right- 
eye coordinate system, the linear projection offset 
translation is the negative of the left-eye value. The 
translation transformation matrix used for the linear 
projection offset is shown as matrix (Bl): 

1 0 0 0 

0 10 0 

0 0 10 

LPOT 0 0 1 


z invert scale. In the right-handed coordi- 
nate system used for this construction, the posi- 
tive z-axis extends behind the viewer. In order for 
the 3-D to 2-D conversion (perspective division) to 
work correctly, the coordinate system must be a left- 
handed system with the positive 2 -axis extending in 
front of the viewer. This transformation is achieved 
by inverting the 2 -axis with a 2 -scale of —1. The 
scale transformation matrix used for this operation 
is shown as matrix (B2): 

10 0 0 

0 10 0 

0 0-10 
0 0 0 1 


Screen distance scale. When perspective divi- 
sion occurs, the 3-D data base is projected onto a 2-D 
plane. To maintain the correct stereo 3-D presenta- 
tion, this 2-D plane must be the same distance away 
as the display screen. To accomplish this, the x-y 
plane of the coordinate system is scaled by the screen 
distance. The scale transformation matrix used in 
this step is shown as matrix (B3): 

SD 0 0 0 

0 SD 0 0 

0 0 10 

0 0 0 1 


Perspective division. The perspective division 
step projects the 3-D scene data base onto a 2-D 
plane. This projection is achieved by dividing the 
x- and y-coordinates of the data base vertices by 
the vertex 2 -coordinatc value. The screen distance 
scale step above has established the 2-D plane to be 
coincident with the display screen. To accomplish 
this division with a transformation matrix, a matrix 
that transposes the homogenous coordinate w with 
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the vertex z-coordinate is used. This matrix is shown 
as matrix (B4): 


to chapter 6 of reference 16. The z-clip translation 
transformation matrix is shown as matrix (B5): 


10 0 0 
0 10 0 
0 0 0 1 
0 0 10 


(B4) 


10 0 0 

0 10 0 

0 0 10 
0 0 ZCT 1 


(B5) 


z-clip translation. After the perspective divi- 
sion operation, the x- and ^-coordinate values of the 
scene data base vertices represent the 2-D projec- 
tion. The z-coordinate value has been transformed 
to its reciprocal. Two transformations arc necessary 
to bring the z-coordinate value within the z clipping 
range. The first of these is a translation. This trans- 
lation is based on the near clipping plane and far 
clipping plane values and is the negative average of 
their reciprocals: 


ZCT = - 


FCP + NCP 
2.0 x FCP x NCP 


z-clip scale. The second of the z-coordinate 
transformations necessary to bring the z-coordinate 
value into the z clipping range is a scale transforma- 
tion. Like the z-clip translation, the 2 -clip scale is 
based upon the near and far clipping plane values, 
but it is equal to the negative reciprocal of one-half 
of the distance between their reciprocals: 


zcs = 


2.0 x FCP x NCP 
FCP - NCP 


For a discussion of the effect of the near and far clip- 
ping planes on the z-clip scale, please refer to chap- 
ter 6 of reference 16. The z-clip scale transformation 
matrix is shown as matrix (B6): 


For a discussion of the effect of the near and far 
clipping planes on the z-clip translation, please refer 


10 0 0 
0 10 0 
0 0 ZCS 0 
0 0 0 1 


(B6) 


Linear viewport separation . The separation between the left-eye and right-eye viewports and the real- 
world scene distance far clipping plane value controls the maximum positive lateral disparity for the current 
partition. As an object in the real-world scene approaches infinity, the object approaches the center of the 
projection, and the center of the projection appears in the center of the viewport. Thus, the viewport separation 
controls where objects near infinity in the real-world scene would appear in the current partition of the piecewise 
linear projection. In a piecewise linear projection, the far clipping plane value is used as the farthest distance 
in the real-world scene that is to appear in the current partition and at the maximum view distance MAXD. 
In most partitions of a piecewise linear projection, the farthest objects drawn are not near infinity, so the 
linear viewport translation amount LVPT must be calculated such that the maximum positive lateral disparity 
is generated by objects at real-world scene distances equal to the far clipping plane value, and the lateral 
disparity generated must produce objects perceived at MAXD. The linear viewport translation, along with the 
linear projection offset translation discussed earlier, is one of the two transformations controlling the slope and 
intercept of the stereo 3-D mapping of the current partition: 

MAXD x MIND x (FCP - NCP) + SD x (NCP x MAXD - FCP x MIND) ' 

MAXD x MIND x (FCP - NCP) 


IPD 

LVPT = — x 


The derivation of this equation is discussed in a later section. The equation above gives the linear viewport 
separation translation amount for the left-eye image. The right-eye value is the negative of the left-eye value. 
The linear viewport separation translation transformation matrix is shown as matrix (B7): 
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(B7) 


1 0 0 0 

0 10 0 

0 0 10 

LVPT 0 0 1 

Field- of -view scale . To maintain the proper aspect ratio and fields of view, the image plane that 
contains the 2-D projection of the 3-D scene data base must be scaled based on the physical constraints 
of the presentation. The three physical parameters that determine FOV and aspect ratio are the width and 
height of the display surface, and the viewer-to-screen distance. The screen distance effect has already been 
accounted for in the screen distance scale step. The display surface width and height must now be accounted 
for. This is accomplished by scaling the ^-coordinate by the reciprocal of one-half the width, and by scaling 
the y-coordinate by the reciprocal of one-half the height: 

f ° vx = i 

FOVY = % 

H 

The FOV scale transformation matrix is shown as matrix (B8): 

FOVX 0 0 

0 FOVY 0 

0 0 1 

0 0 0 


(B8) 


Piecewise linear/ asymmetric projection transformation matrix . Sequentially multiplying the eight 
matrices presented above will generate the piecewise linear/asymmetric projection transformation matrix. This 
matrix is shown as matrix (B9). The conversion of the 3-D scene data base to the 2-D perspective view used for 
the current partition in the left-eye and right-eye images of the stereo pair with this matrix as the projection 
matrix is as discussed in section 6.5 in reference 16: 


(SD)(FOVX) 

0 

0 

0 

0 

(SD)(FOVY) 

0 

0 

-(LVPT) (FOVX) 

0 

— (ZCT)(ZCS) 

-1 

— (LPOT) (SD) (FOVX) 

0 

-(ZCS) 

0 


Piecewise Linear Projection Mapping Conversions and Geometry 

Figure B1 represents the geometry for an object at the far clipping plane distance for one partition of 
a left-eye piecewise linear projection. The center of projection for the left-eye image is translated from the 
center of the viewer coordinate system (point B ) by the linear projection offset translation step in the matrix 
construction, by amount LPOT, to point A. For an object at point (7, which is FCP distance from the viewer 
in the real-world scene, line AC represents the projection of the objects as seen from the center of projection. 
The image is then presented to the viewer in a viewport that is offset, by the linear viewport translation step 
in the matrix construction, from the center of the viewer coordinate system by amount LVPT to point D. 
Thus, when the object is viewed it is shifted to point F (lines AC and DF are parallel) and displayed on the 
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Figure Bl. Geometry at far clipping plane. 


screen where line DF intersects the screen plane, which is SD distance from the viewer, at point E. The left 
eye then views the object with a lateral disparity of amount YF and the object appears MAXD distance from 
the viewer in the stereo 3-D virtual image. From the lateral disparity equation in figure 4, the equation for the 
object lateral disparity YF can be determined: 


YF = 


IPD 


X 



SD \ 
MAXD ) 


From similar triangles ABC and DY E, the relationship 

LPOT _ LVPT - YF 
FCP “ SD 

can be determined. Solving this equation for LPOT produces 


LPOT = 


FCP x (LVPT - YF) 
SD 


Figure B2 represents the geometry of an object at the near clipping plane distance for one partition of a 
left-eye piecewise linear projection. Like the far clipping plane geometry of figure Bl, the center of projection has 
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Figure B2. Geometry at near clipping plane. 


been translated to point A , and the viewport center has been translated to point D. Assuming the geometries 
in figures B1 and B2 arc from the same piecewise linear projection, the amounts LPOT, IPD, LVPT, and SD 
are the same between the figures. For figure B2, line AC represents the projection of an object located at 
point C, which is distance NCP from the viewer in the real-world scene. When the object is presented to the 
viewer from the center of the viewport at point D , the object is shifted to point F and is displayed on the 
screen where line DF intersects the screen plane, point E. The left eye then views the object with a lateral 
disparity of amount YN and the object appears distance MIND from the viewer in the stereo 3-D virtual 
volume. From the lateral disparity equation in figure 4, the equation for the object lateral disparity Y N can 
be determined: 


YN — 


IPD 


( „ SD \ 

( ' mind) 


From similar triangles ABC and DYE, the relationship 


LPOT LVPT - YN 
NCP “ SD 


can be determined. Solving this equation for LPOT produces 


LPOT = 


NCP x (LVPT - YN) 
SD 


ITT 
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Substituting the LPOT equation derived from figure B1 for LPOT in the equation derived from figure B2 
generates 


FCP x (LVPT - YF) NCP x (LVPT - YN) 
SD “ SD 


Substituting the equations for Y F and Y N into the above relationship and solving for LVPT will produce the 
linear viewport translation equation: 


LVPT = 


IPD 

~~ 2 ~ 


x 


MAXD x MIND x (FCP - NCP) + SD x (NCP x MAXD - FCP x MIND) 
MAXD x MIND x (FCP - NCP) 


For a left-eye projection matrix, the linear viewport translation is to the left, which requires a negative value 
for the coordinate system used. Substituting the LVPT equation above into either of the two LPOT equations, 
with appropriate reductions, will generate the linear projection offset equation 


LPOT = 


IPD 

2 


x 


FCP x NCP x (MAXD - MIND)' 
.MAXD x MIND x (FCP - NCP). 


The linear projection offset translation value for a left-eye projection matrix is to the right, a positive value. 
This seemingly reversed sign is due to the fact that in the construction of projection matrices, the point of 
projection is considered fixed and the relative motion of the world is to the right for the left eye. 


As a final derivation, the equation for the mapping of real-world scene distances to stereo 3-D piecewise 
linear partition distances can be determined from the geometry in figure B3 and from the LPOT, LVPT, and 
the figure 4 lateral disparity equation. From the geometry in figure B3, an object located at point (7, which is 
distance d! from the viewer in the real-world scene, will appear to the viewer at a distance of d in the stereo 
3-D virtual presentation. With similar triangles ABC and DYE , and the LPOT, LVPT, and y (from fig. 4) 
equations, the stereo 3-D virtual distance d can be derived as a function of the real-world scene distance d! . 
This function is 


d! x MAXD x MIND x (FCP - NCP) 
d f x (FCP x MIND - NCP x MAXD) + FCP x NCP x (MAXD - MIND) 


As a proof of the validity of this equation, two tests may be made: (1) If the real-world scene value of FCP 
is substituted for the real-world scene distance d ! , the equation reduces to MAXD. (2) If the real-world scene 
value of NCP is substituted for the real-world distance d\ the equation reduces to MIND. 
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